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Abstract 

We consider the following random graph process: starting with n isolated vertices, 
add edges uniformly at random provided no such edge creates a copy of C4. We show 
that, with probability tending to 1 as n — > 00, the final graph produced by this process 
has maximum degree 0((n log n) 1 / 3 ) and consequently size 0(n 4 / 3 log(ra) 1 / 3 ), which 
are sharp up to constants. This confirms conjectures of Bohman and Keevash and of 
Osthus and Taraz, and improves upon previous bounds due to Bollobas and Riordan 
and Osthus and Taraz. 



1 Introduction 

The H-free process, where if is a fixed graph, is the random graph process which begins 
with a graph C7(0) on n isolated vertices. The graph G(i) is then formed by adding an edge 
Ci selected uniformly at random from the pairs which neither form edges of G(i — 1) nor 
create a copy of H in G(i — 1) + e^. The process terminates with a maximal if-free graph 
G(M) with M = M(H) edges. 

Erdos, Suen and Winkler [6] suggested this process as a natural probability distribu- 
tion on maximal if-free graphs, and asked for the typical properties of G(M(H)), such as 
size and independence number. They considered the odd-cycle-free and triangle-free pro- 
cesses, establishing that the former terminates with 0(n 2 ) edges with high probability 1 , 
and that, for some positive constants ci, 02,03, w.h.p. cqn 3 / 2 < M(K 3 ) < C2n 3 / 2 (logn) and 
a{G{M{Kz))) < C3A/n(logn). These bounds were improved by Spencer [13], who further 
conjectured that w.h.p. M(K 3 ) = G(??, 3 / 2 i/logn). (We mention that the earliest result on 
an H-iree process is due to Rucifiski and Wormald [11], who that the maximum- degree d 
process terminates in a graph with \nd/2\ edges with high probability - here H is the star 
graph K x4+ i.) 

More general H-iree processes, where H satisfies an additional density condition, were 
first studied by Bollobas and Riordan [5] and by Osthus and Taraz [9] independently. For 
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the remainder of this section, the bounds we mention are assumed to hold w.h.p. unless 
stated otherwise. We say a graph H is 2-balanced if e(H) > 3, v(H) > 3, and 

e(H) - 1 e(F) - 1 
v(H) - 2 ~ v(F) - 1 

for all proper subgraphs F of H with v(F) > 3, and strictly 2-balanced if the inequality is 
sharp for all such F; examples of such graphs include cycles, complete graphs, and complete 
bipartite graphs K Tjr , r > 2. For this class of graphs, Bollobas and Riordan established 
general lower bounds on M(H), and upper bounds for H G {C4, K4} that match to within a 
logarithmic factor. Osthus and Taraz then gave upper bounds for all strictly 2-balanced H 
that match to within a logarithmic factor. For H = C4, the results of Bollobas and Riordan 
yield M(C 4 ) = fi(n 4 / 3 ) and M(C 4 ) = 0(n 4 / 3 (logn) 3 ); Osthus and Taraz's results improve 
the upper bound to M(d) = 0(n 4 ^ 3 logn), and they further conjectured that the average 
degree of the C;-free process is 0((n logn) 1 /^ -1 )) for all / > 3. Evidence that the lower 
bound was not sharp came from Wolfovitz [16] who improved the lower bound on E (M(H)) 
for regular strictly 2-balanced if by a factor of (log log n) 1 ^ 6 ^ -1 ^. 

Finally, through an application of the differential equations method (for the general 
method and examples, see [19]), Bohman [2] showed M(K 3 ) = Q(n 3 / 2 ^\ogn), confirming 
Spencer's conjecture, and produced an improvement on the lower bound for M(K&). Sub- 
sequent work by Bohman and Keevash [3] established new lower bounds on M(H) for all 
strictly 2-balanced H by producing lower bounds on the minimum degree of G(M(H)), and 
they conjectured that the likely maximum degree of G(M(H)) is at most a constant mul- 
tiple of their lower bound. For the C^-free process, their bound on the minimum degree is 
0((nlogn) 1 / 3 ), yielding M(C 4 ) = fi(n 4 / 3 (log n) 1 / 3 ). 

We mention some motivation for studying the if-free process comes in part from its 
connection to two classical areas of extremal combinatorics, Ramsey theory and Turan theory. 
Bounds on the independence number of G(M(K$j) found in [6] and [13] led to the best lower 
bounds on R(3, t) known at the time, and Bohman's analysis [2] produced an improvement 
that matched Kim's celebrated lower bound [8]. The analysis in [2] and [3] has also led to 
the best current lower bounds for the Ramsey numbers R(s, t), with s > 4 fixed and t large, 
and the cycle-complete Ramsey numbers R(Ci,K t ) for I > 4 fixed and t large. The results 
for the K rr -fiee process in [16] and [3] have resulted in improvements on the best known 
lower bounds for the Turan numbers ex(n, K r ^ r ) for r > 5. 

However, the process has also become a subject of recent interest on its own, in part for 
aspects of Bohman and Keevash's analysis of the strictly 2-balanced case that suggest the 
graph G(i) produced by the process resembles the random graph G(n,i), chosen uniformly 
at random from all i-edge graphs on n vertices, with the exception that it contains no 
copies of H. To establish their lower bound, they show that a wide range of subgraph 
extension variables, including the degree of a vertex and the number of copies of a given 
if- free graph F, take roughly the same values in G{i) as in G(n,i), for i up to a small 
multiple of n 2- ^^ -2 ^ 6 ^ -1 ) (logn) 1 ' t 6 ^ -1 -*. (Similar results on subgraph counts in the 
f\3-free process were obtained by Wolfovitz [18].) In fact, the lower bound and conjectured 
upper bounds on M(H) in [3] correspond (within constant factors) to the threshold for the 
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random graph G(n,i) to have the property that the addition of any new edge creates a copy 
of H, provided H is strictly 2-balanced (see [12]). It is also known (see [7] and [14]) that 
sufficiently dense subgraphs are unlikely to appear in the final graph G(M(H)). 

Very recently, Warnke [15] and Wolfovitz [17] have independently given upper bounds 
on M(K±) that match Bohman's lower bound to within a constant factor. The author [10] 
has also established similar bounds for the case where H is the diamond graph, formed by 
removing an edge from K4. (The diamond graph is 2-balanced but not strictly so.) Along 
with K 3 , these are the only 2-balanced graphs containing a cycle for which such bounds on 
M(H) are currently known. Our aim is to add C4 to this list through the next result. 

Theorem 1. There exists k > such that A.(G(M(C&))) < k(ji log(n)) 1 / 3 with high proba- 
bility. 

This confirms the mentioned conjectures of Osthus and Taraz and of Bohman and Keevash 
for the CVfree process. Combined with the lower bound given in [3], this has the following 
immediate corollary. 

Corollary 1. With high probability, M(C 4 ) = 6(n 4 / 3 (logn) 1 / 3 ). 

From an upper bound established in [3], as well as known bounds on the independence 
number of C^-free graphs with bounded maximum degree, we arrive at the next result easily. 

Corollary 2. With high probability, a{G(M{C A ))) = 0((nlogn) 2/3 ). 

An immediate consequence of this second corollary is that a typical graph produced by the 
C^-free process will not essentially improve the lower bound on R{C^ K t ) given in [3]. 

To establish our bound, we use a fairly simple observation: suppose we fix a vertex v 
and a step i < M = M(C±). If x and y are neighbors of v in G(M) but are nonadjacent to 
v in G(i), then x and y have no common neighbors in G(i). We can therefore establish an 
upper bound on A(G(M)) of the form A(G(i)) + k by showing that every set of k vertices 
contains two which share a neighbor in G(i). Thus, to prove Theorem 1, we simply need to 
make appropriate choices of i and k. 

The remainder of this paper is organized as follows: in the next section we discuss the C4- 
free process specifically, including relevant results from [3], and in Section 2.3 we introduce 
our main technical lemma (Lemma 1) and prove Theorem 1 and Corollary 2. Section 3 will 
cover a few preliminary results for our proof of Lemma 1, including a lemma from [3] which 
forms the basis for our differential equations method application, and the proof of Lemma 1 
will follow in Section 4. 

2 The C^-free process 
2.1 Definitions and notation 

We let [n] — {1, . . . , n} be the vertex set of the process, and G(i) the graph given by the first 
i edges selected by the process. G{i) naturally partitions (y) into three sets, E(i), 0(i), 
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and C(i). E(i) is simply the edge set of the process. For a pair uv ^ E(i), we say uv is 
open, and uv G 0(i), if the graph G(i) + uv is C^-free. Otherwise, we say uv is closed 
and uv G C(i). For t> G [n], we let Ni(v) and <ij(w) denote the neighborhood and degree, 
respectively, of v in G(i). 

For z > and a pair of vertices uv G (^') \ E(i), we define C w (i) to be the set of pairs 
wz G 0(i) such that G(i) + uv + wz contains a copy of C4 that uses both uv and as 
edges. Equivalently, C uv (i) is the collection of open pairs which, if added as edges, would 
create a path of length three between u and v. We mention that, in [3], C uv is defined as the 
set of ordered pairs; we will work exclusively with unordered pairs. 

We introduce a continuous time variable t, and relate it to the process by setting t = 
t(i) = i/n A /' s . We fix constants n,e, V, W, which satisfy 

0< / ,«e«i F «I«i 

(The notation < a ^ b means there is an increasing function f(x) so the arguments which 
follow are valid for < a < fib).) Given these constants, we define 

p = n - 2 /3 ; m = ^(logn) 1/3 -n 4/3 , and t max = /i(logn) 1/3 . (1) 

We further define functions q(t), c(t), P(t), e(t) as well as parameters s — s(n) and s e = s e (n) 
as follows: 

q{t) = exp(-8t 3 ), c(t) = 24t 2 exp(-8t 3 ), (2) 

P{ t) = W (t i + t), e(t) = e p(i) - 1, (3) 

s(n) = n 2 p = ?7, 4//3 , and s e (n) = n 1/s ~ £ . (4) 

We assume that e and \x are chosen sufficiently small that e(t) and are at most n £ 

for < t < t max , and s e = n 1 ^ -6 n £ , so e(t)/s e = o(l) (uniformly with respect to n) for 
<t < t max . We will discuss additional bounds on fi,e, V and W further in Section 3.3. 



2.2 The lower bound - results of Bohman and Keevash 

Bohman and Keevash [3] established their lower bound on the if-free process by showing 
that certain random variables are tightly concentrated throughout the initial m steps. As 
we do not require the full strength of their results, we summarize the relevant consequences 
for the C^free process in the following theorems. 

Theorem 2 (Bohman and Keevash, [3]). Let Ti* denote the event that the following hold for 
0<i< i*: 
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2. For all v G [n], 

di(v)= (l±^) Ut±^jn P , (6) 
and so A(G(i)) < 4t max np. 

3. For alluv G 0{i)UC{i), 

|CWi)|=(l±^) (W g (i)±£)^, (7) 

and for all distinct uv, u'v' G 0(i), 

\C uv (i)nC uV (i)\ Kn-^p- 1 (8) 

Then T m holds with high probability. 

Theorem 3 (Bohman and Keevash, [3]). With high probability, a(G(m)) < 3/i~ 1 (nlogn) 2 / 3 . 

Recalling that we may choose e and \i so that e(t)/s e = o(l) and q{t) > n e ^ 1 / s e , T m 
implies Q(m) > and consequently the lower bound M{C/^) > /in 4 / 3 (log n) 1 ^ 3 holds with 
high probability 

Equation (5) follows immediately from Theorem 1.4 of [3]. Equation (6) follows similarly, 
while the bound on A(G(i)) follows from bounding e(t)/s e above by 1/3 and l/s e above by 
t max . Equations (7) and (8) follow from Corollary 6.2 and Lemma 8.4 of [3], respectively. 
We mention that the phrasing of Lemma 8.4 suggests that uv and u'v' are fixed. However, 
as Lemma 8.4 is shown to be a consequence of a constant (depending on H) number of 
applications of Lemma 5.2, which has exponentially small failure probability (conditioned 
on their event Q m ), (8) follows for all such pairs and steps i, < % < m, with high probability. 
Finally, Theorem 3 follows from the proof of Theorem 1.9 of [3] (specifically, Lemmas 11.3 
and 12.1). 



2.3 The upper bound - proofs of Theorem 1 and Corollary 2 

Let (3 > be a fixed constant satisfying 

> 4' ( 9 ) 

and define 

k = (3 ■ (nlogn) 1/3 . (10) 

Our aim is to show that, in G(m), every /c-element subset of [n] contains two vertices that 
share a neighbor. We say a k-set K is covered in G(i) if a common neighbor exists in G(i) 
for some pair of vertices in K; K is uncovered otherwise. As previously mentioned, for 
any v G [n], no pair of vertices in Nm(c 4 )(v) \ N m (v) can have a common neighbor in G(m) 
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- if every A;- set is covered in G(m), this yields dM(c 4 )( v ) < d m (v) + k = 0((nlogn) 1//3 ), the 
desired bound. 

If a set K is covered in G(i), then there exists a triple of vertices u, v, w such that uv G (^) 
and uw, vw G £?(«)■ We note that the order of u and v is not essential, and that we expect 
that it is likely (but not necessary!) that the common neighbor w does not lie in K. We 
therefore restrict ourselves to considering certain subsets of (^) x ([n] \ if). We will write 
elements of (^) x ([n] \K) as (wt>, w) but will refer to them as triples to avoid confusion with 
elements of (y). We will also identify each such triple (uv,w) with the subset {uw,vw} of 

('?)• 

We introduce the following definitions: given K G (^) and < i < m, we define 

uw,vw G O(i) j> , and 

\{uw,vw} n 0(i)| = \{uw,vw} n £(z)| = 1 j . 

We call triples in Xx(i) open with respect to K, and triples in Yk{i) partial with 
respect to K. We note that if (uv, w) G Yx(i), where, without loss of generality, uw G 0(i), 
then if we select e i+ i = uw, K is covered in all steps i' > i + 1. Equally important, if K is 
uncovered in G(i), then every pair in 0(i) lies in at most one triple (uv,w) G Yk(i) (viewed 
as {uw,vw}), as otherwise w has at least two neighbors in K. 

Intuitively, the probability that a given pair of vertices xy is open at time t = t(i) is 
~ q(t), while the probability that a pair is an edge is ~ 2tp. It is then reasonable to suspect 
that 

1**001 « (2) •(«-*)• ?(^) 2 w ^y" ' and 

|y^(i)|«2rj(n-A;)(g(t))(2tp)«2t g (t).A;V 
The following lemma shows that these estimates are correct for uncovered K. 

Lemma 1. With high probability, for all i, <i < m, and K G ( t ), if K is uncovered in 
G(i) then 

\Xk(i)\ 
\Y K (i)\ 

Via an argument similar to that used to bound the independence number of the K r -free and 
C r -free processes in [2] and [3], we next show how Lemma 1 implies Theorem 1. 

Proof of Theorem 1. We may assume the conclusions of Theorem 2 and Lemma 1 hold, as 
their failure probability is o(l). We also assume that fi,e,V and W are chosen sufficiently 



X K (i) 
Y K (i) 



uv,w) G 
[uv, w) G 



x([n]\K): 
x([n]\K): 



1 ± 



e(t)\ q{t) 



1 ± 



n 3e 

e(t) 



n 



■?>■ 



± —t- k n, and 



n 



3e 



2tq(t) ± ^ ) k 2 np. 



n 
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small that q{t)~ l and e(t) are at most n £ on [0, t max ], and s e > n 3e . By Theorem 2, with 
high probability, A(G(m)) < 4/x(n logn) 1 / 3 . Letting k = 4/x + /3, we establish the bound in 
Theorem 1 by showing that w.h.p. every fc-set is covered in G(m). 

Given an uncovered K at step i, as ej+i is chosen uniformly at random from Q(i) open 
pairs, and as each partial triple (uv,w) G Yx(i) contains a unique open pair, the probability 
K remains uncovered in G(i + 1) is at most 1 — gffl ■ We restrict our attention to bounding 
the probability that some k-set K remains uncovered for all steps i, m/2 < i < m. For 
n sufficiently large, m/2 > n 4//3 , so in this range of i we may assume t = t(i) > 1. Thus, 
if K is uncovered in G(i) with m/2 < i < m, then as s e > n 3e , e(t)/n 3e < 1/3 and 
l/n 3e < q{t)/2 < tq(t)/2 for t max /2 <t< t max , from Theorem 2 and Lemma 1 we have 

\Y K (i)\ _ (1 ± e(t)/n 3£ )(2tq(t) ± l/n 3e )k 2 np 2/3 • (3/2)tq(t)k 2 np t mSLX k 2 p 



Q(i) (l±e(t)/s e )(q(t)±l/s e )n 2 /2 ~ 4/3 • (3/2)g(t)n 2 /2 " 2n 

Therefore, the probability that a k-set K exists which remains uncovered for all i, m/2 < 
i < m is at most 

n>\ ( \ t max k p\ t, / t m3x k pm 

1 1 < n exp 1 



k J V 2n I V 4n 



n k exp 



n k exp 



/i(logra) 1//3 (/3(nlogn) 1 / 3 ) 2 ■ n 2 ^ 3 ■ jj,n^ 3 {\.og(n)) 1 ^ 3 

An 

li 2 f3 2 n l l 3 (\ogn) 4 / 3 * 



As n fc = exp( / 9n 1 / 3 (logn) 4 / 3 ), this probability is o(l) provided /x 2 /3 2 /4 > /3, i.e. (9) holds. □ 

We note again that Corollary 1 follows immediately, so we turn to the proof of Corollary 2. 
The upper bound on a{G(M{C±))) follows from Theorem 3, as a(G(M(Ci))) < a{G{m)). 
For the lower bound, we apply a lemma from [4] bounding the independence number of 
graphs with few triangles. (Similar bounds are known for a wider class of H-hee graphs - 
see [1].) 

Lemma 2 ([4], Lemma 12.16 (ii)). Let G be a graph on n vertices with average degree at 
most d and at most h triangles. Then 

,^ In/ ,1 ( h 
a{G) > — ■ - log d- - log 



10 d V 2 \n 

Proof of Corollary 2. Letting M = M{C^, by Theorem 1, with high probability the average 
degree of G(M) is at most k = ft(nlogn) 1//3 , where k > is a fixed constant. As G(M) is 
C^free, each edge lies on at most one triangle, so G(M) has at most M/3 triangles. Taking 
d = k and h = kn in Lemma 2, and observing that \og(k) > log(n)/3 for n sufficiently large, 
we have 

a(G(M)) > 1 ■ ? (llogk) > -±-{n\ogn) 2 ' 3 . 
10 k \2 J 60k 

□ 
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3 Preliminaries 



We use the notation "±" in two distinct ways throughout this paper. The notation a ± b 
will be taken to mean the interval {a + xb : — 1 < x < 1}; distinct instances of ± used this 
way in the same expression will be treated independently, i.e. (a ± 6)(c± d) will be taken to 
mean {{b + X\c)(d + x^c) : —1 < X\, 22 < 1}- We will also write a = b±c instead of a G b±c. 

For a sequence of random variables A(1),A(2), . . . ,, we will use A ± to denote pairs of 
sequences of nonnegative random variables A + (l), A + (2), . . . and A~(l), A~(2), . . ., such that 

A{% + 1) - A(i) = A + (i) - A~(i). 

Similarly, for a differentiate function f(t), we will use / + and /~ to denote the positive and 
negative parts of fit). 

3.1 A density claim 

An important part of our argument will be showing that the maximum one-step change 
in the variables we track is sufficiently bounded. This will turn out to be straightforward 
for the number of open triples with respect to a given K, but to establish effective bounds 
on the maximum one-step decrease in the number of partial triples, we will appeal to a 
simple bound on the number of paths of length three between any two vertices in G(i). We 
mention that in the binomial random graph G(n,p), the expected number of such paths 
is (n — 2){n — 3)p 2 ~ 1, so the upper bound we establish below of n 1 / 4 is reasonable to 
expect. We mention that this bound is by no means optimal, but more than suffices for our 
arguments. 

Lemma 3. Let Vi be the event that, in G(i), for every pair of distinct vertices u and v, there 
are at most n 1//4 paths of length 3 between them. Then, conditioned on 7~ m , V m holds with 
high probability. 

To prove Lemma 3 we first establish a simple claim. 

Claim 1. Given distinct vertices u and v in a C^-free graph G, any two paths of length 3 
between u and v are edge-disjoint. 

Proof. Suppose (u, x, y, v), (u, x', y', v) are distinct paths from u to v in G that share an edge. 
If the shared edge is ux (so x = x'), then the vertices x, y, v, y' form a C4, a contradiction; 
similarly if the shared edge is vy. If the shared edge is xy, then, as the two paths are distinct, 
x' = y,y' = x, and the vertices u, y, v, x form a C4, again a contradiction. □ 

Proof of Lemma 3. We fix vertices u and v and bound the probability that there are n 1 ^ 
such paths between them in G(m). On any such path between u and v , for the last of the 
three edges added, e = e*, we must have Ci 6 C uv (i — 1) by definition. Conditioned on 
T m , for % = 1, . . . ,m, the probability that e« G C uv {i — 1) is at most \C uv (i — i)\/Q(i) < 
n e p~ x /n 2 ~ £ = n 2e /n 4 / 3 . By Claim 1, as any two such paths are edge-disjoint, it suffices 



s 



to bound the probability that e, G C uv (i — 1) for n 1//4 steps i: noting m < n e ■ n 4 / 3 for n 
sufficiently large, this is at most 



2e \ ™ 1/4 / 2e \ ™ 1/4 / 3e* " 1/4 

m \ I n \ I men \ I en 



< _i/A-A/ a < -T7T < exp(-n 



The result then follows from a union bound over the (™) choices of u and t>. □ 

As V m implies Vi for any i, < i < m, Lemma 3 and Theorem 2 imply Vi holds for all i, 
< i < m, with high probability. 

3.2 The differential equations method 

To show that our variables follow the conjectured trajectories, we appeal to an approach 
to the differential equations method presented in Lemma 7.3 from [3]. The only difference 
in the statement is the notation change of X^ A instead of Y^ A . We reproduce from [3] the 
setup for this lemma: suppose we have a stochastic graph process defined on [n], where n is 
large. Let r be a fixed positive integer, and for j G [r], let kj, Sj be parameters (which can 
depend on n). 

Suppose for each j G [r] and A G CC!), there is a sequence of random variables Xj tA (i), 
defined for i — 0, . . . , m and measurable with respect to the underlying graph process. 
Further, we suppose 



A',,(/ • 1)-.V,,(/) X:ji)-X jA 



where X£ A (i),Xj A {i) > 0. We relate these sequences to functions on [0, oo) by letting 
t = i/s for some function s = s(n) that tends to infinity. The goal is then to argue that, for 
some collection xj(t) of continuous functions, 

X jsA (i) « XjifySj 

for all j G [r] and A G (H) , i — . . . , m. We view 1 < j < r as the type of random variable, 
and the set A as giving its position in the graph. The parameter Sj is the size-scaling for 
the jth type of random variable. 

Lemma 4 ([3], Lemma 7.3). Let < e < 1 and c, C > be constants, and suppose for each 
j G [r] we have a parameter Sj(n) and functions Xj(t),ej(t),9j(t),jj(t) that are smooth and 
nonnegative for t > 0. For i* — 1, 2, . . . , m, let Qi* be the event that 



Sj J \ Sj 



for all 1 < i < i* , 1 < j < r, and A G (^')- Suppose there is also a decreasing sequence of 

events Hi, 1 < i < m, such that lim^oo Pr(l-L m \ Q m ) = 1, and that the following conditions 
hold: 



9 



1. (Trend hypothesis) When conditioning on Qi A "H,, we have 

hj(t)\ Sj 



for all j G [r] and A G where x^(t) and hj(t) are smooth nonnegative functions 

such that 

x'j(t) = x^(t) — x~(t) and hj(t) = (ejXj + 7j)'(i); 
(Boundedness hypothesis) For each j G [r], conditional on Qi A Hi, we have 

X U^ < -#7; 

J ' Sjkjn 6 

3. (Initial conditions) For all j G [I], we have 7j(0) = and Xj a{0) = SjXj(0) for all 

4- We have n 3e < s < m < n 2 , m < n £ l 2 s, s > A0Cs 2 kjn e , n 2e < Sj < n~ e s, 

^ j (t) + e j (t)x j (t)/2- lj (t)/2>c, 

POO 

sup \xf(t)\ < C, sup \x'At)\ <C, / \x"At)\ dt < C, 

t>0 t>0 Jo 

rm/s 

sup \hj(t)\ < n e , / \hj(t)\ dt < rf. 

0<t<m/s JO 

Then Pr[Q m A W m ] — > 1 as n -> 00. 



3.3 Additional inequalities and the constants fi, e, V, W 

As much of the remainder of this paper will be devoted to verifying the conditions of Lemma 
4, we take the opportunity now to gather a few simple inequalities. First, in addition to the 
constraints on fi, e, V and W implicit in [3], the following bounds suffice for our application: 

640e 2V 1 

V > 40 W > — — e< — 

~ 71og(2)' - 100' 



and jx is chosen sufficiently small so that e p ^ < n £ l 2 for all t G [0,t max ], provided n is 
sufficiently large. 

We observe that as t max = /x(logn) 1,/3 = o(n a ) for any a > 0, it follows that if F(t) is 
a fixed polynomial and a > 0, we have < n a on [0,t max ] for all n sufficiently large. 

To simplify some of our later calculations, we mention a few additional inequalities which 
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follow directly from our choice of the constants above, (l)-(4), and (10), for all t in [0,t max ] 
and n sufficiently large: 

1 < q{t)~ x < q\t)e p ^ < q(t)e p ^ < e p ^ < n e , 

q(t)s e > n 5e , and n 1/3 < k < n 1/3+£ . 

Furthermore, conditioned on the event 71*, < i* < m, by Theorem 2 we have 

Q(i) > n 2 ~ £ and \C uv {i)\ < n 2/3+£ 

for < i < i* and all uv eO{i)U C(i). 

Finally, we will repeatedly make use of the following simple lemma. 

Lemma 5. Suppose rj = rj(n) — > as n — > oo, and a, b are positive integers. Then, for n 
sufficiently large, 

1. (1 ± ar]){l ± brj) C (1 ± (a + b + 1)77). 

2. (1 iar/)" 1 C (1± (0 + 1)77). 

Proof. Both containments follow from if = 0(77), the latter from considering the series 
expansion of (1 + x)~ x . □ 

4 Proof of Lemma 1 

Our proof of Lemma 1 will follow from an application of Lemma 4. We recall s = s(n) = n 4 / 3 , 
and, for t > 0, we define 

x(t) = q(t) 2 /2, x + (t) = 0, x~{t) = c(t)q(t) = 24t 2 q(t) 2 , 

y(t) = 2tq(t), y + (t) = 2q(t), and y~{t) = 2tc{t) = A8t 3 q(t). 

so that x ± , y ± are nonnegative on [0, oo), x' = x + — x~ and y' = y + — y~ . 

For K G (v) and < i < m, let Ux,i denote the event that K is uncovered in G(i), and 
let Sx,i be the event 

&K,i = ^K,i A % A Vi , 

where 71 and Vi are defined in Theorem 2 and Lemma 3, respectively. We are only interested 
in ensuring bounds that hold with high probability for uncovered K at each step i, and, by 
Theorem 2 and Lemma 3, Pr (T m V V m ) = o(l), so it suffices to show the desired bounds 
hold with high probability for all K and i for which Sx,i holds. We will therefore apply 
Lemma 4 to a modified collection of random variables that follow the correct trajectory 
deterministically on the event Sx,i, which we define as follows. 
For i — 0, 1, . . . , m: 



X+(i 



_ (\X K (i + l)\X K (i)\ if £ K ,i holds, 
1 x + (t) ■ k 2 n/s otherwise, 
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and 



Similarly, let 



X K (i) 



\X K (i)\X K (i + l)\ if S K>i holds, 

• k 2 n/s otherwise, 

|X y (0)| + fc3 + fc 2 "~ fc2 if i = 

X K (z - 1) + X£(i) - Xx(i) otherwise. 

'\Y K (i + l)\Y K (i)\ if S K>i holds, 

y + (t) ■ k 2 np/s otherwise, 

'|Xkr(i)\ykr(iH-l)| if ^ holds, 

■ k 2 np/s otherwise, 



and 



|Vir(0)| if i = 0, 

Y K (i-l) + Y£(i) - Y K (i) otherwise. 

It follows that on the event £ K>i , X K (i) = \X K (i)\ + fc3+fc "~ fc2 » |X^(z)| and Y^(i) = |Y^(z) 



To set up our application, we recall m = m(n) = /iQogn) 1 / 3 ?^ 4 / 3 , and we let c = 1/4 and 
take C > to be a sufficiently large constant. We let k\ = k 2 = k, x\ = x, x 2 = y, and for 
Ke we let 

= X K (i) : Si = k 2 n, X 2 ,K(i) —Y K (i), and 5 2 = k 2 np. 

As = and = Y^, we will write the latter for ease of reading. 
We define, for t > 0, 

7(f) = - (1 - exp(-640e 2y • t)) and 9(t) = ^ + j(t). 

For j G {1,2} we define the remaining error parameters as 



Sj =n 



3f 



ej (t) = e(t), 7i(*) = 7(<) and 9 j {t) = 9{t), 



where e(t) is defined in (3). 

So, for < i* < m, Qi* is the event that 



X K (i) = (l ± ^ ± ^'j A 2 n and 



S2 / \ S 2 



(11) 



for all K E ( and < i < i*. We take the event Hi = Qi for all i, which trivially is 
decreasing and satisfies lirn^oo Pr(H m \Q m ) = 1. 
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The initial conditions follow easily: Y K (0) = = y(0)k 2 np, and 

**(0) = y (n - fc) + = — = x(0)k 2 n. 

We point out that as 6{t) < 3/4 and (k 3 + kn — k 2 )/2 = o(l) ■ k 2 n/si, the conclusions of 
Lemma 1 hold on the event Q m t\T m A P m , so as TJn A "Pm holds with high probability, it 
suffices to show Q m holds with high probability. 

We next note that, as intended, the trend and boundedness hypotheses follow determin- 
istically for and on the event Ek,% - the trend hypothesis is trivial. The boundedness 
hypothesis follows from the inequalities (which we will establish!) \x ± \ < C,\y ± \ < C and 
s > A0Cs 2 kjn £ for j G {1,2}. It therefore remains to show they hold when conditioned on 
Gi A 8k, i- 

4.1 Open triples 
4.1.1 Trend hypothesis 

As X K (i) = for all i, the trend hypothesis for X K follows, so we turn to X K . To simplify 
our calculations, all functions in the expressions which follow are assumed to be evaluated 
at t — t(i), and we will write q in place of q(t), etc.. To avoid potential confusion, we will 
use "e" to refer to the function defined in (3), and "e" to refer to the constant e = 2.718 . . .. 

Conditioned on £k,u a triple (uv,w) G X K {i) gets counted by X K (i) if and only if 
e i+ i G {uw,vw} U C uw {i) U C vw {i). As K is uncovered, it follows that uw C vw {i) and 
vice-versa, and therefore the probability of this occurring, conditioned on Qi A Ska, is 



\C uw {i) U C vw {i)\ +2 2(1 ± e/s e ){c ± 12/s e )p _1 /2 ± {n~ l/i p~ l + 2) 



Q{i) (l±e/s e )(q±l/s e )n 2 /2 

2(1 ± e/s e )(c ± (12/se + n" 1 / 4 + 2p))p~ 1 /2 



C 



C 



{l±e/s e )q(l±l/(qs e ))n 2 /2 
2(l±e/ Se )(c±13/s e ) 



:i±e/s e )g(l±l/(gs e )) 



C 2,l ± -|(l ± ^^ ± ^Vl ± AV^ 



c 



q qs e j v <?s e / n 2 



2 /"l ± - V- ± 13 + 2c /g + 26 /(^) ^| . 



s e J \q qs e J n 2 



S\ J \ q S\ J n 2 



where the last containment follows from c/q = 24t < n e and qs e > n si. Summing this 
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over all (uv,w) G X K {i) and using (11) yields 



E (XiWIft A S K:i ) = (l ± £) (, ± 1) ^ • (l ± ± I) 



C 



6e \ / 2a;c x + 2c/ q + l/s\ \ k 2 n 
s x ) \ q si 



6e\ /2a:c ± 48£ 2 + 2\ fc 2 n 



S\J \ q si 
'2a:c ± 12exc/g + 48t 2 + 2 + 6e(48t 2 + 2)/si^ A; 2 n 



9 «i 
c /2a;c 288t 2 a;e + 48t 2 + 3\ A; 2 
\ 9 si / s 

It remains to show that 288i 2 x(t)e(t) + 48t 2 + 3 < fci(t)/4, where ^i(t) = (are + 7)'(t): 
routine calculations yield 

h x = {-48t 2 x)e + xP'e p{t) + 7 ' 

> -48t 2 xe p « + W^(3t 2 + l)ze p W 

> I¥(t 2 + l)xe p(i) , 

as 48 < 2W. As the inequalities x(t)e(t) < x(t)e p(t) and 1 < 2x(f)e p(i) hold, it suffices to 
show 

(384t 2 + 6)x(t)e p ® < ^(t 2 + l)x(t)e p « 
which follows as W > 4 • 384 = 1544. 

4.1.2 Boundedness hypothesis 

As no new open triples are created in any step, the bound follows for trivially. Further- 
more, as every open pair in 0{i) lies in at most k triples in X K (i), conditioned on Q i A £ka 
we have 

\x~ K {i)\ < {\c ei+ M + 1 )- k ^ n2£ - n <^ = ^ 

as k > n 1 / 3 > n 9s for n sufficiently large. 

4.2 Partial triples 
4.2.1 Trend hypothesis 

We begin by establishing the bounds for E (Y^\Qi A Sk,i)- A triple (uv,w) G X K {i) enters 
Yx(i + 1) if and only if e*+i G {uw,vw}, which occurs with probability 



Q(i) (l±e/s e )(q±l/s e )n 2 /2 q \ s e ) \ qs e J n 



,2 
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Summing over the triples (uv,w) G X K (i) and using (11) yields 



EWWIftA^) = ( 1± ±)( I± i)^.?.( 1± |)( 1± A).| 



s l J \Q s l / V <? S e/ ™ 

si/ V<? s i / V s iJ s 

^^_Ae\ fx ^q^ 1 + x/q + q" 1 /s\\ k 2 np 



si J \q si 
4e\ /z (T 1 + 1^ Pnp 



si/ \q si 
( x Aexq^ 1 + q^ 1 + 1 + 4e(g _1 + l)/s]\ k 2 np 
\q si J s 

c /4x 8ge + 4g~ 1 + 8\ fc 2 np 
U s i / s 

where the last containment follows from the inequality 4e(g _1 + 1) < 8n 2e < si for n 
sufficiently large. 

It remains to show that 8q(t)e(t) + 4g(t)~ 1 + 8 < h 2 (t)/A, where h 2 (t) = (ye + j)'{t). 
Straightforward calculations yield 

h 2 = (2 - 48t 3 )ge + (2tq)W(3t 2 + l)e p(t) + i 

> 2W(t 3 + t)qe m + y. (12) 

To show the desired inequality, we consider two cases: t < V/W and t > V/W . If t < 
V/W < 1, then as t 3 + t < 2t, 

As g(t)e p W > qit)" 1 > 1, it follows that 8q(t)e(t) + 4g(fp 1 + 8 < 20g(t)e p W < 20e 2y . On 
the other hand, we have 

h 2 (t) > j{t) _ 640e 2y ■ e - 640e2V V4 > 640e 2y _ 2y 
4~4~ 4 ~32~ e ' 

as W > 640e 2y • V/ log(2). If t > V/W, then, as 7' is nonnegative, 



which suffices as V > 40. 
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Next, we turn to E (Y^\Qi A £K,i)'- for each triple (uv,w) E Yk{i), where without loss 
of generality uw G 0(i), the probability that (uv,w) gets counted by Y^{i) conditioned on 
Qi A £ K ,i is 

\C uw {i)\ + 1 (1 ± e/s e ){c ± 12/s e )j?-V2 + 1 



Q{i) (l±e/s e )(q±l/s e )n 2 /2 



± 



the last containment following from s e q > n e S2 and 2c/ q = 48t 2 < n e on [0,t max ]. 
Consequently, 

E (Y^ m a = (i ± ±) ( y ± 1) *v • (i ± g g ± i) £ 



c 



1 ± ^ f — ± y + c / qJr \ k np 
s 2 / U s 2 / s 



C 48t 3 q(t) ± 



S2j \ q s 2 
'yc Qeyc/q + 26t 2 + 1 + 6e(26t 2 + l)/s 2 \ & 2 np 

s 2 / s 

288t 3 qe + 26t 2 + 2\ k 2 np 

To establish the required bound, we first observe that, using t 2 <t 3 + t and g(t)e p(4) > 1 
for t > 0, we have 

288t 3 q(t)e(t) + 26t 2 + 2 < 314(t 3 + t)q(t)e p{t) + 2. 

As VP > 4(314) = 1256, we have 314(t 3 + t)q(t)e p ^ < f (t 3 + t)q(t)e p ^, so by (12) it suffices 
to show that 

Again considering the cases t < V/W and t > V/W separately, by the arguments given 
above we have Y(f)/4 > 20e 2y for t < V/W and f (t 3 + t)q(t)e p ^ > V/A for t > V/W, 
which suffices as V > 8. 



4.2.2 Boundedness hypothesis 

We recall that it suffices to show the boundedness hypothesis holds conditioned on QiA£x,i = 
Qi A Uk,i A % A Vi, which we assume throughout this subsection. We start with Y£: as a 
given open pair lies in at most k open triples, we have 

„l/3 q 

|Y+(t)| <k< k- l— = -g-. 

1 K w ' n 7e s 2 fcn £ 
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Turning to Y%, we recall that as K is uncovered, each partial triple (uv,w) G Yx(i) 
contains a unique open pair, which we will take to be uw without loss of generality. However, 
the trivial bound |Y^(z)| < |C ei+1 (i)| + 1 does not suffice, as for most steps i, \C ei+1 (i)\ = 
n 2/3+o(i)^ while the required upper bound S2/ (s^kn 6 ) = n 2 / 3 - 7e +°( 1 ) ; so we must be more 
careful. 

A triple (uv, w) G Y K {i) is counted by Y^(i) if and only if e i+ i G {uw}UC uw (i). Suppose 
ej + i = xy: we separately bound the number of partial triples (uv, w) removed from Yx(i) 
based on the intersection of the open pair uw with xy: trivially, at most one such triple has 
uw = xy, so let 

A\ = {(uv, w) G Yk(i) : uw G 0(i) D C xy (i), uw D xy = 0}, 

A 2 = {(uv, w) G Y K (i) : uw G 0(i) fl C xy (i), uw D xy = {w}}, and 

A3 = {(uv, w) G Y K (i) : uw G 0(i) fl C xy (i), uw Hxy = {u}}. 

Suppose first that (uv,w) G A\\ as uw G C xy (i), we have that either {ux, wy} C E'(i) 
or C -E(i): in the former K is uncovered we must have u as the unique 

neighbor of x in .fT and w as a neighbor of y, for which there at most A(G(i)) such choices. 
Analogous reasoning for the second case yields 

L4x| < 2A(G(i)) < 8fi(n log n) 1/3 . (13) 

Next, consider a (uv,w) G A 2 , and suppose first that x = w: then there is a vertex z 
such that {i/z, zu} C E(i). There are at most A(G(i)) choices of z adjacent to y, and fixing 
z, at most one choice of u G K; analogous reasoning for the case y = w yields 

\A 2 \ < 8/2(n\ogn) 1/3 . (14) 

Turning now to A3, we partition A3 into A3U/I3, where A' 3 contains those triples (uv , w) G 
A3 with u = x, and A'^ contains those triples with u = y. We first bound \A' 3 \: let 

B = {w G [n] : (uv, w) G A' 3 for some v G K}. 

As for all (uv,w) G A 3 we have u = x and as no open pair is contained in more than one 
partial triple, it follows that \B\ = \A' 3 \. Since for each w G B we also have uw G C xy (i), it 
follows that there exists a vertex z w such that {yz™, z w w} C -E(i). Let 

B' = {weB:z w (£K,yw(£ E(i)}. 

Claim 2. 

|£| - 2A(G(z)) < < A;n 1/4 . 

Proof. For the lower bound, it follows trivially that y has at most A(G(i)) neighbors in 
B. Similarly, if w,w' G B with z w ,z w i G K, then, as z^,^/ are neighbors of y and is 
uncovered, z w = z w >, so we have z w G K for at most A(G(z)) distinct w G -B. 

For the upper bound, we note that each w G -B' has exactly one neighbor in K \ {y}, 
and that z w ^ i^. It follows that there are at least \B'\ paths of length 3 in G(i) from y to 
K \ {y}, so for some v G K \ {y}, there are at least \B'\/k paths of length 3 from y to v. 
But as Vi holds, there are also at most n 1 / 4 such paths, and the result follows. □ 
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n 2 / 3 < S 2 
n le ~ slkn £ 



From Claim 2, 

|^| = \B\ < \B'\ + 2A(G(i)) < fcn 1 / 4 + 8/i(nlog(ri)) 1 / 3 , 
and applying analogous arguments to \A 3 \ lets us conclude 

\A 3 \ < 2kn 1/4 + 16//(ralog(n)) 1/3 . (15) 
Finally, combining (13)-(15) with the fact that (nlogn) 1 / 3 = o^kn 1 ^ 4 ), we have 
Yx(i) < 1 + \Ai\ + \A 2 \ + \A 3 \ < 2kn 1/4 + o{kn 1/4 ) < 3kn 1/4 
for n sufficiently large. As k > n 1//3 and 3k < n 1 / 3+e for n sufficiently large, therefore 

Y~(i) < n 7 ' 12+e < n*' 12 - l£ = ■— < 

provided e < 1/96, and the boundedness hypothesis is verified. 

4.3 Analytic considerations 

Here we verify the remaining inequalities from Part 4. of Lemma 4, recalling that we chose 
c = 1/4 and C sufficiently large. First, from (1), we have t max = /i(logn) 1 / 3 = o(n £ l 2 ) so 
for large n, we have n 3e < s < t max s = m < n £ / 2 s < n 2 . For any fixed constant C > 0, 
j G {1, 2}, and n sufficiently large, as kj < n 1 / 3+e , 

AOCsfan 6 < 40Cn 1/3+8e < n 1/3+9e < s. 

As x(t),y(t), e(t),j(t) are nonnegative, it follows from the definitions that, for j 6 {1, 2}, 

inf9j(J) + £iWfiW_IiW> inf I + 2W > i 

t>o n ' 2 2 _ t>o 2 2 4 

Next, we observe from the definitions of x^x^^y^y^ and straightforward differentiation 
that we can bound, for t > 0, lar^, j^l, \x"\, and above by a function of the 

form H(t) = F(t)e~ 8t , where F is a polynomial of degree 5 with nonnegative coefficients. 
It is straightforward to see that both sup <>0 H{t) and J °° H(t) are finite, so provided C is 
greater than the larger of these two, sup <>0 sup t>0 and J °° \x"(t)\ dt are all 

less than C. 

Finally, turning to the inequalities involving hj = {ejXj + 7j)', it is easy to see that 
sup t>0 |7'(i)| and / °°|7"(^)l are bounded. Calculations similar to those establishing lower 
bounds on h±, h 2 easily yield that Ke^x-,)'! and \{ejXj)"\ are bounded above by a function of 
the form F(t)e p ^\ where F is a polynomial of degree 5 with nonnegative coefficients. As 
F{t) and e p ^ are increasing, e P( ^ < (e p ^)', and m/s = t max , it follows that 

/ F{t)e m dt < F(t max )e p(w) = sup F{t)e p(t) . 
Jo o<t<t max 

As t max = /i(logn) 1 / 3 = o(n £ / 10 ), say, F{t max ) = o(n e/2 ), and as e PW < n e/2 on [0,t max }, 
the bounds sup < t < m / s \hj(t)\ < rf and f™ /s \tij(t)\ dt < rf easily follow, and the proof of 
Lemma 1 is complete. 
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